Model Selection

MSP-Podcast Dataset

# MSP-Podcast Dataset

SER Odyssey Baseline WavLM Arousal

A speech emotion recognition baseline model based on the WavLM architecture, specifically designed to predict arousal values in speech (0-1 range)

Audio Classification

Transformers English

SER Odyssey Baseline WavLM Dominance

A speech emotion recognition model based on the WavLM architecture, specifically designed to predict dominance features in speech

Audio Classification

Transformers English

SER Odyssey Baseline WavLM Multi Attributes

A multi-attribute speech emotion recognition baseline model based on WavLM architecture, predicting arousal, dominance, and valence dimensions

Audio Classification

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase